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Abstract — Any two-way finite state automaton is equivalent 
to some one-way finite state automaton. This well-known result, 
shown by Rabin and Scott and independently by Shepherdson, 
states that two-way finite state automata (even non-deterministic) 
characterize the class of regular languages. It is also known 
that this result does not extend to finite string transductions: 
(deterministic) two-way finite state transducers strictly extend 
the expressive power of (functional) one-way transducers. In 
particular deterministic two-way transducers capture exactly the 
class of MSO-transductions of finite strings. 

In this paper, we address the following definability problem: 
given a function defined by a two-way finite state transducer, is it 
definable by a one-way finite state transducer? By extending Ra- 
bin and Scott's proof to transductions, we show that this problem 
is decidable. Our procedure builds a one-way transducer, which 
is equivalent to the two-way transducer, whenever one exists. 

I. Introduction 

In formal language theory, the importance of a class of 
languages is often supported by the number and the diversity 
of its characterizations. One of the most famous example is 
the class of regular languages of finite strings, which enjoys, 
for instance, computational (automata), algebraic (syntactic 
congruence) and logical (monadic second order (MSO) logic 
with one successor) characterizations. The study of regular 
languages has been very influential and several generalizations 
have been established. Among the most notable ones are the 
extensions to infinite strings [lj and trees [2 |. On finite strings, 
it is well-known that both deterministic and non-deterministic 
finite state automata define regular languages. It is also well- 
known that the expressive power of finite state automata does 
not increase when the reading head can move left and right, 
even in presence of non-determinism. The latter class is known 
as non-deterministic two-way finite state automata and it is no 
more powerful than (one-way) finite state automata. The proof 
of this result was first shown in the seminal paper of Rabin 
and Scott 0, and independently by Shepherdson |0J. 

The comfortable picture of automata models over finite 
strings changes substantially when, instead of languages, string 
transductions, i.e. relations from strings to strings, are consid- 
ered. Transducers generalize automata as they are equipped 
with a one-way output tape. At each step they read an 
input symbol, they can append several symbols to the output 
tape. Their transition systems can be either deterministic or 
non-deterministic. Functional transducers are transducers that 
define functions instead of relations. For instance, determin- 
istic transducers are always functional. In this paper, we are 
interested in transducers that define functions, but that can be 
non-deterministic . 



As for automata, the reading head of transducers can 
move one-way (left-to-right) or two-way. ( One-way) finite 
state transducers have been extensively studied J5), (6J. Non- 
deterministic (even functional) one-way transducers (NFTs) 
strictly extend the expressive power of deterministic one-way 
transducers (DFTs), because some local transformations that 
depend on properties of the future of the input string can be 
defined using non-determinism. 

Two-way finite state transducers define regular transfor- 
mations that are beyond the expressive power of one-way 
transducers [7 |. They can for instance reverse an input string, 
swap two substrings or copy a substring. The transductions 
defined by two-way transducers have been characterized by 
other logical and computational models. Introduced by Cour- 
celle, monadic second-order definable transductions are trans- 
formations from graphs to graphs defined with the logic MSO 
1 8 1 . Engelfriet and Hoogeboom have shown that the monadic 
second-order definable functions are exactly the functions 
definable by deterministic two-way finite state transducers 
(2DFTs) when the graphs are restricted to finite strings (0. 
Recently, Alur and Cerny have characterized 2DFr-definable 
transductions by a deterministic one-way model called stream- 
ing string transducers iflOl and shown how they can be applied 
to the verification of list-processing programs lUTI . Streaming 
string transducers extend DFTs with a finite set of output string 
variables. At each step, their content can be reset or updated by 
either prepending or appending a finite string, or the content 
of another variable, in a copyless manner. Extending 2DFTs 
with non-determinism does not increase their expressive power 
when they define functions: non-deterministic two-way finite 
state transducers (2NFTs) that are functional define exactly 
the class of functions definable by 2DFTs |9|. To summarize, 
there is a strict hierarchy between DFT-, functional NFT- and 
2Z?fT-definable transductions. 

Several important problems are known to be decidable 
for one-way transducers. The functionality problem for NFT, 
decidable in PTime lfl2l . |[T3l . asks whether a given NFT is 
functional. The determinizability problem, also decidable in 
PTime lfl4ll . |[T3ll . asks whether a given functional NFT can 
be determinized, i.e. defines a subsequential function. Subse- 
quential functions are those functions that can be defined by 
DFTs equipped with an additional output function from final 
states to finite strings, which is used to append a last string 
to the output when the computation terminates successfully in 
some final state. Over strings that always end with a unique 
end marker, subsequential functions are exactly the functions 
definable by DFTs. For 2NFTs, the functionality problem is 



known to be decidable [15]. Therefore the determinizability 
problem is also decidable for 2NFTs, since functional 2NFTs 
and 2DFTs have the same expressive power. In the same line 
of research, we address a definability problem in this paper. 
In particular we answer the fundamental question of NFT- 
definability of transductions defined by functional 2NFTs. 

Theorem 1. For all functional 2NFTs T, it is decidable 
whether the transduction defined by T is definable by an NFT. 

The proof of Theorem Q] extends the proof of Rabin and 
Scott [3 1 from automata to transducers^. The original proof of 
Rabin and Scott is based on the following observation about 
the runs of two-way automata. Their shapes have a nesting 
structure: they are composed of many zigzags, each zigzag 
being itself composed of simpler zigzags. Basic zigzags are 
called z-motions as their shapes look like a Z. Rabin and Scott 
prove that for automata, it is always possible to replace a z- 
motion by a single pass. Then from a two-way automaton A 
it is possible to construct an equivalent two-way automaton B 
(called the squeeze of A) which is simpler in the following 
sense: accepting runs of B are those of A in which some z- 
motions have been replaced by single pass runs. Last, they 
argu^l that after a number of applications of this construction 
that depends only on the number of states of A, every zigzag 
can be removed, yielding an equivalent one-way automaton. 

The extension to 2NFTs faces the following additional 
difficulty: it is not always possible to replace a z-motion of a 
transducer by a single pass. Intuitively, this is due to the fact 
that 2NFTs are strictly more expressive than NFTs. As our 
aim is to decide when a 2NFT T is iVfT-definable, we need 
to prove that the ATr-definability of T implies that of every 
z-motion of T, to be able to apply the squeeze construction. 
The main technical contribution of this paper is thus the study 
of the AfFr-definability of z-motions of transducers. We show 
that this problem is decidable, and identify a characterization 
which allows to prove easily that the ATr-definability of T 
implies that of every z-motion of T. 

This characterization expresses requirements about the out- 
put strings produced along loops of z-motions. We show 
that when z-motions are AffT-definable, the output strings 
produced by the three passes on a loop are not arbitrary, but 
conjugates. This allows us to give a precise characterization of 
the form of these output strings. We show that it is decidable 
to check whether all outputs words have this form. Last, we 
present how to use this characterization to simulate an NFT- 
definable z-motion by a single pass. 

Applications By Theorem Q] and since functionality is decid- 
able for 2NFTs, it is also decidable, given a 2NFT, whether 
the transduction it defines is definable by a functional NFT. 
Another corollary of Theorem[T]and the fact that functionality 
of 2NFTs and determinizability of NFTs are both decidable is 

'Shepherdson |4] and then Vardi |16 | proposed arguably simpler con- 
structions for automata. It is however not clear to us how to extend these 
constructions to transducers. 

2 To our knowledge, there is no published proof of this result, thus we prove 
it in this paper as we use it for transducers. 



the following theorem: 

Theorem 2. For all 2NFTs T, it is decidable whether the 
transduction defined by T is a subsequential function. 

A more practical instance of definability problems is the 
verification of memory consumption for document processing 
applications in streaming, such as validation, queries and trans- 
formations of texts, lists and XML documents. In this scenario, 
the input data is received and processed as a stream of events. 
This huge amount of data is not stored, and minimal memory 
consumption is expected. Streamable operations are operations 
that can be evaluated with constant memory, independently 
of the input size. The streamability problem, which asks 
whether a given processing task is streamable, can be seen 
as a particular class of definability problems. For instance, 
streamability of validation against a DTD amounts to decide, 
given a DTD, whether there exists an equivalent finite state 
automaton, when restricted to well-formed XML documents. 
It has been partially solved in ifTTl . fl8l . Streamability of 
XML queries |19| and transformations lEol also started to 
be addressed. Over finite strings, a transformation defined by 
a functional NFT T is streamable iff the transduction defined 
by T is subsequential, which is decidable. Theorem [2] can be 
seen as a generalization of this decidability result to regular 
transformations, i.e. transformations defined by functional 
2NFTs, MSO transducers or streaming string transducers. 

Related work Most of the related work has already been 
mentioned. To the best of our knowledge, it is the first result 
that addresses a definability problem between two-way and 
one-way transducers. In [21], two-way transducers with a two- 
way output tape are introduced with a special output policy: 
each time a cell at position % of the input tape is processed, 
the output is written in the cell at position i of the output tape. 
With that restriction, it is shown that two-way and one-way 
transducers (NFTs) define the same class of functions. In ll22l . 
the result of Rabin and Scott, and Shepherdson, is extended to 
two-way automata with multiplicities. In this context, two-way 
automata strictly extend one-way automata. 

Organization of the paper Section [II] introduces necessary 
preliminary definitions. In Section [III] we describe the general 
decision procedure for testing A^r-definability of functional 
2NFTs. We introduce z-motion transductions induced by 
2NFTs and show that their M*T-definability is necessary. The 
decidability of this necessary condition as well as the construc- 
tion from z-motion transducers to NFTs are the most technical 
results of this paper and are the subject of Section [IV] We 
finally discuss side results and further questions in Section [VI 

II. One- Way and Two- Way Finite State Machines 

Words, Languages and Transductions Given a finite alpha- 
bet £, we denote by S* the set of finite words over E, and 
by e the empty word. The length of a word u G E* is its 
number of symbols, denoted by For all i € {1, ... , |u|}, 
we denote by u[i] the i-th letter of u. Given 1 < i < j < \u\, 
we denote by the word u[i]it[i+l] . . .u[j] and by 



the word it[j]u[j — 1] . . .u[i]. We say that v G E* is a factor 
of u if there exist u\, u-i G E* such that u = u-yvui- By u we 
denote the mirror of «, i.e. the word of length \u\ such that 
u[i] = u[\u\ - i + 1] for all 1 < i < \u\. 

The primitive root of u G E* is the shortest word w such 
that u = tr for some integer fc > 1, and is denoted by p,(u). 
Two words u and t> are conjugates, denoted by ~, if there 
exist iji/eS* such that u — xy and w = yrr, i.e. u can be 
obtained from v by a cyclic permutation. Note that ~ is an 
equivalence relation. We will use this fundamental lemma: 

Lemma 1 ( 1231 ). Let u, v G E*. fnere exists n > smc/i 
f/iaf u™ ant/ w™ have a common factor of length at least \u\ + 
\v\ — gcd(\u\, \v\), then fi(u) ~ /^(w). 

Note that if p,(u) ~ p,(v), then there exist x, y G E* such 
that u G (zy)* and w G (ya;)*- 

A language over E is a set i C E*. A transduction over 
E is a relation i? C E* x E*. Its domain is denoted by 
dom(i?), i.e. dom(i?) = {u \ 3v, (u, v) G i?}, while its image 
{« | 3u, (u,v) G i?} is denoted by img(R). A transduction 
R is functional if it is a function. 

Automata A non-deterministic two-way finite state automaton 
(2NFA) over a finite alphabet E is a tuple A = (Q, go, -F, A) 
where Q is a finite set of states, qo G Q is the initial state, 
F C Q is a set of final states, and A is the transition relation, 
of type A C Q x E x Q x {+1, — 1}0 It is deterministic if 
for all (p, a) G Q x E, there is at most one pair (q, m) G 
Q x {+1, —1} such that (p, a, q, m) G A. In order to see how 
words are evaluated by A, it is convenient to see the input as 
a right-infinite input tape containing the word (starting at the 
first cell) followed by blank symbols. Initially the head of A 
is on the first cell in state qo (the cell at position 1). When A 
reads an input symbol, depending on the transitions in A, its 
head moves to the left (— 1) if the head was not in the first 
cell, or to the right (+1) and changes its state. A stops as soon 
as it reaches a blank symbol (therefore at the right of the input 
word), and the word is accepted if the current state is final. 

A configuration of A is a pair (q, i) G Q x (N — {0}) 
where q is a state and i is a position on the input tape. A 
run p of A is a finite sequence of configurations. The run 
p = (pi,ix) . . . (p m , im) is a run on an input word u G E* 
of length n if p\ = qo, i\ = 1, i m < n + 1, and for all 
k G {l,...,m- 1}, 1 < ik < n and (p k ,u[i k ],p k+1 ,i k+1 - 
i k ) G A. It is accepting if i m = n + 1 and p m G F. The 
language of a 2NFA A, denoted by L(A), is the set of words 
u such that there exists an accepting run of A on u. 

A non-deterministic (one-way) finite state automaton (NFA) 
is a 2NFA such that ACQxExQx {+1}, therefore we 
will often see A as a subset of Q x E x Q. Any 2NFA is 
effectively equivalent to an NFA. It was first proved by Rabin 
and Scott, and independently by Shepherdson J4). 
Transducers N on- deterministic two-way finite state transduc- 
ers (2NFTs) over E extend NFAs with a one-way left-to- 

3 We follow the definition of Vardi 1161 . but without stay transitions. This 
is without loss of generality though. 



right output tape. They are defined as 2NFAs except that 
the transition relation A is extended with outputs: A C 
QxExE*xQx {— 1,-1-1}. If a transition (q,a,v,q' , m) 
is fired on a letter a, the word v is appended to the right of 
the output tape and the transducer goes to state q'. Wlog we 
assume that for all p, q G Q, a G E and m G {+1, —1}, there 
exists at most one v G E* such that (p, a, v, q, m) G A. We 
also denote v by out(p, a, q, m). 

A run of a 2NFTs is a run of its underlying automaton, 
i.e. the 2NFAs obtained by ignoring the output. A run p 
may be simultaneously a run on a word u and on a word 
v! ^ u. However, when the underlying input word is given, 
there is a unique sequence of transitions associated with 
p. Given a 2NFT T, an input word u G E* and a run 
p = (pi, i\) . . . (p m , im) of T on u, the output of p on it, 
denoted by o\\t u (p), is the word obtained by concatenating 
the outputs of the transitions followed by p, i.e. out"(p) = 
out(pi,w[zi],p 2 ,«2-ii) • • ■out(p m _i,u[i m _i],p m ,i m -i m _i) 
If ,0 contains a single configuration, we let 
out u (p) = e. When the underlying input word u is 
clear from the context, we may omit the exponent 
u. The transduction defined by T is the relation 
^(^) = ou t"(p)) | p is an accepting run of T on m}. 
We may often just write it T when it is clear from the 
context. A 2NFT T is functional if the transduction it defines 
is functional. The class of functional 2NFTs is denoted by 
f2NFT. In this paper, we mainly focus on f2NFTs. The 
domain of T is defined as dom(T) = dom(R(T)). The 
domain dom(T) is a regular language that can be defined 
by the 2NFA obtained by projecting away the output part of 
the transitions of T, called the underlying input automaton. 
A deterministic two-way finite state transducer (2DFT) is a 
2NFT whose underlying input automaton is deterministic. 
Note that 2DFTs are always functional, as there is at most one 
accepting run per input word. A non-deterministic (one-way) 
finite state transducer (NFT) is a 2NFT whose underlying 
automaton is an Wvfl It is deterministic (written DFT) if 
the underlying automaton is a DFA. 

We say that two transducers T, T 1 are equivalent, denoted 
by T = T' , whenever they define the same transduction, i.e. 
R{T) = R(T'). For all transducer classes C, we say that a 
transduction R C E* x E* is C-definable if there exists TgC 
such that R—R(T). Given two classes C, C of transducers, and 
a transducer T G C, we say that T is (effectively) C'-definable 
if one can construct an equivalent transducer T" G C. 

The (C,C)- definability problem takes as input a transducer 
TgC and asks to decide whether T is C'-definable. If so, one 
may want to construct an equivalent transducer T" G C. In this 
paper, we prove that (J2NFT,NFT) -definability is decidable. 

It is known that whether an NFT T is functional can be 
decided in PTime [12]. The class of functional NFTs is denoted 
by JNFT. Functional NFTs are strictly more expressive than 
DFTs. For instance, the function that maps any word u G 

4 This definition implies that there is no e-transitions that can produce 
outputs, which may cause the image of an input word to be an infinite 
language. Those NFTs are sometimes called real-time in the literature. 



{a, b}+ to aH if u[\u\] = a, and to otherwise, is fNFT- 
definable but not DfT-definable. This result does not hold for 
2NFTs: functional 2NFTs and 2DFTs define the same class of 
transductions (Thm 22 of [9|). 

Examples Let E = {a, b} and # g" E, and consider the 
transductions 

1) R = {(u,a |M| ) | u G E+ v[|u|] = a} 

2) R 1 = | u G E+,u[|u|] = b}(JR Q 

3) i? 2 = {(#«#, | m S E*}. 

i?o is Z)Tr-definable: it suffices to replace each 
letter by a and to accept only if the last letter 
is a. Therefore it can be defined by the DFT 

To={{ia,qb},qbA < ia\A{ix,y,a,qy) I x,yeE}). 

Ri is /TVTT-definable but not DfT-definable: 
similarly as before we can define a DFT Tq = 
({Pa,Pb},Pa, {Pb}, {(Px,y, b,p v ) | x,y £ E}) that defines the 
transduction {(it, foH) | u G E + , u[\u\] = b}, and construct an 
NFT Xi as follows: its initial state is some fresh state po> an d 
when reading a; € E the first time, it non-deterministically 
goes to To or Tq by taking the transition (po,x,a,q x ) or 
(po,x,b,p x ), and proceeds in either To or Tq. Even if R\ 
is functional, it is not DfT-definable, as the transformation 
depends on the property of the last letter, which can be 
arbitrarily faraway from the beginning of the string. 

R 2 is 2DfT-definable: it suffices to go to the end of the 
word by producing e each time a letter is read, to go back 
to the beginning while copying each input letter, and return 
to the end without outputting anything, and to accept. Hence 
it is defined by T 2 = ({q , q 1} q 2 , q s , g/}, q , {g/}, 5 2 ) where 
states gi, 52, ?3 denote passes, and 5 2 is made of the transitions 
(g ,#, e,gi,-|-l), {qi,x£'S,e,qi,+l) (during the first pass, 
move to the right), (g x , #, e, q 2 , -1), (q 2 , xGE, x, q 2 , -1), 
(?2)#,#)?3,+l), (?3,a;eE,e,(?3,+l), (g 3 , #, #, g /; +1). 

Crossing Sequences, Loops and Finite-Crossing 2NFTs The 
notion of crossing sequence is a useful notion in the theory 
of two-way automata 2), l24l . that allows one to pump runs 
of two-way automata. Given a 2iVM A, a word u G E* and a 
run p of A on u, the crossing sequence at position i, denoted 
by CS(p, i) is given by the sequence of states q such that 
(q,i) occurs in p. The order of the sequence is given by 
the order in which the pairs of the form (q, i) occur in p. 
E.g. if p = {q u l)(g 2 , 2)(g 3 , l)(g 4 , 2)(< 7s , l)(g fl) 2)(g 7 , 3) then 
CS{p,l) = qiq 3 q5, CS(p,2) = q 2 q±q§ and CS(p,3) = q 7 . 
We write CS(p) the sequence ^(p, 1), . . . , CS{p, \u\ + 1). 

Crossing sequences allow one to define the loops of a run. 
Given a run p of the 2NFA A on some word u of length n, a 
pair of positions is a Zoop[f| in p if (i) 1 < i < j < n, 
(ii) CS{p,i) = CS(p,j) and (in) u[i] — u[j]. Let u\ = 
u[l..(i — 1)], u 2 — u[i..(j — 1)] and u 3 — u[j..n]. If is a 
loop in p and u £ then ui(u 2 ) k U3 G T(^4) for all k>0. 

We say that a loop is empty if i = j, in this case we 

5 Observe that we include the input letter in the notion of loop. We use this 
to avoid technical difficulties due to backward transitions (which do not read 
the local symbol, but its successor). 
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Fig. 1. Zigzags removal by applications of squeeze. 

have u 2 = s. The notions of crossing sequence and loop carry 
over to transducers through their underlying input automata. 

Given a 2NFT T, N G N and a run p of T on a word 
of length n, p is said to be iV-crossing if \CS(p,i)\ < N 
for all i G {1, . . . ,n}. The transducer T is finite-crossing if 
there exists N G N such that for all (u, v) G R(T), there is an 
accepting TV-crossing run p on u such that out(p) = v. In that 
case, T is said to be ./V-crossing. It is easy to see that if T is N- 
crossing, then for all (u, v) G R(T) there is an accepting run 
p on u such that out(p) = v and no states repeat in CS(p, i) 
for alH G {1, . . . , |w|}. Indeed, if some state q repeats in some 
CS(p, i), then it is possible to pump the subrun between the 
two occurrences of q on CS(p,i). This subrun has an empty 
output, otherwise T would not be functional. 

Proposition 1. Any f2NFT with N states is N -crossing. 

III. From Two-way to One-way Transducers 

In this section, we prove the main result of this paper, i.e. 
the decidability of (f2NFT,NFT) -definability. 

A. Rabin and Scott's Construction for Automata 

The proof of Theorem Q] relies on the same ideas as Rabin 
and Scott's construction for automata 0. It is based on the 
following key observation: Any accepting run is made of 
many zigzags, and those zigzags are organized by a nesting 
hierarchy: zigzag patterns may be composed of simpler zigzag 
patterns. The simplest zigzags of the hierarchy are those that 
do not nest any other zigzag: they are called z-motions. Rabin 
and Scott described a procedure that removes those zigzags 
by iterating a construction that removes z-motions. 

A one-step sequence is an indexed sequence s = a\, . . . , a n 
of positions such that aj G {l,2...,m}, a% — 1, a n = m, 
and \a,i+i — Oj| = l.The sequence s is N -crossing if for all 
x G {1,2..., m} we have \{i | at = x}\ < N. The reversals 



of s are the indexes 1 < r% < T2 < ■ ■ ■ < rj < n such that 
a ri+ i — a n -%. In the sequel we let fo = 1 and r/+i = n. 

A z-motion z in s is a subsequence a e , a e +x, ■■■ a f sucn that 
there is < i < I with r<_i < e < < r,+i < f < 
and a e = a n+1 and a/ = a n . We may denote z by the pair of 
reversals fa,r i+ i). E.g. the sequences Z\ — 1,2,3,2,1,2,3 
and Z2 — 4, 3, 2, 3, 4, 3, 2 are z-motions. The shape of a run p 
is defined as the second projection of p, written shape(p). A 
run p is a z-motion run if shape(p) is a z-motion. When there 
is no ambiguity, z-motion runs are just called z-motions. 

If T is a 2NFA, it is possible to construct a new automaton 
denoted by squeeze(T) such that, for all accepting runs p of T 
on some input word u, there exists a "simpler" accepting run 
of squeeze(T) on u, obtained from p by replacing some z- 
motions by one-way runs that simulate three passes in parallel. 
It is illustrated by Fig. Q] For instance at the first step, there 
are two z-motions from q% to 92 and from 93 to 94 respec- 
tively. Applying squeeze(T) consists in non-deterministically 
guessing those z-motions and simulating them by one-way 
runs. This is done by the NFA i?<r(<7i, 92) and Rt(<13,<14,) 
respectively. Depending on whether the z-motions enter from 
the left or the right, z-motions are replaced by runs of NFAs 
Rt(-, •) (that read the input backwardly) or Lt(-, •) , as 
illustrated by the second iteration of squeeze on Fig. Q] 

An iV-crossing run p can be simplified into a one-way run 
after a constant number of applications of squeeze. This result 
is unpublished so we prove it in this paper. In particular, we 
show that if p is ^-crossing, then its zigzag nesting depth 
decreases after N steps. Moreover, if p is TV-crossing, then its 
zigzag nesting depth is also bounded by N. Therefore after 
N 2 applications of squeeze, p is transformed into a simple 
one-way run. It is sufficient to prove those results at the level 
of integer sequences. In particular, one can define squeeze(s) 
the set of sequences obtained from a one-step sequence s 
by replacing some z-motions of s by strictly increasing or 
decreasing subsequences. The following is formalized and 
shown in Appendix: 

Lemma 2. Let s be an N -crossing one-step sequence over 
{1, . . . , m}. Then 1,2, ... ,m is in squeeze^ (s). 

At the automata level, it is known that for all words u 
accepted by a 2NFA T with N states, there exists an TV- 
crossing accepting run on u. Therefore it suffices to apply 
squeeze N 2 times to T. One gets an equivalent 2NFA T* 
from which the backward transitions can be removed while 
preserving equivalence with T*, and so T. 

B. Extension to transducers: overview 

The construction used to show decidability of NFT- 
definability of f2NFT follows the same ideas as Rabin and 
Scott's construction. The main difference relies in the trans- 
formation of the local transducers defined by z-motion runs 
(that we call ZNFTs) into NFTs. Our procedure is built over 
a ZNFT-to-NFT procedure. It is seen as a black-box in this 
section, but is the subject of the next section. 
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Compared to two-way automata, one faces an extra diffi- 
culty caused by the fact that 2NFTs (and ZNFTs) are not 
always A^r-definable. Therefore one defines a necessary 
condition that has to be tested each time we want to apply 
squeeze. Let us consider again Fig.[T]when T is a 2NFT. One 
defines from T the transductions induced by local z-motion 
runs from a starting state qi to an ending state (72, and show 
that those local transductions must be A'Fr-definable. 

Once this necessary condition is satisfied, the construction 
squeeze can be applied and works as for Rabin and Scott's 
construction: the new transducer squeeze(T) simulates T and 
non-deterministically may guess that the next zigzag of T 
is a z-motion run from some state qi to some state q2, and 
thus can be simulated by a run of some NFT Rt (91,92) or 
Lx(qi, 92), depending on whether it enters from the left or the 
right. Then squeeze(T) switches to i?T (91,92) (if it entered 
from the right) and once Rx(qi, 92) reaches an accepting state, 
it may come back to its normal mode. 

C. z-motion transducers 

z-motion transducers are defined like 2NFTs except that 
they must define functions and to be accepting, a run on 
a word of length n must be of the form p.(qf,n + 1) 
where p is a z-motion run and 9/ is an accepting state. 
Note that it implies that shape^) is always of the form 
1, . . . , n, n— 1, . . . ,1, . . . ,n. The class of z-motion transducers 
is denoted by ZNFTs. Note that z-motion transducers are 
incomparable with f2NFTs. Indeed, z-motion transducers can 
for instance define the transduction u 6 E* h-> u, which is not 
/2ATr-definable as there are no ending markers. 

Let T e ZNFT and p = (pi, 1) . . . (p n , n) 
(g„_i,n-l)...(9i,l)(r2,2)...(r n+ i,n + 1) be a run 
of T on a word of length n. We let q n = p n and r\ = 91 
and define the following shortcuts: for 1 < i < j < n, 
outi[i,j] = out((pi,i) . . . (pj,j)), and out 2 [i,j] = 
out((9 J; j) . . . (qui)) and out 3 [z,j] = out (fa, i) . . . (r , j)), 
and out 3 [i,n + 1] = out(fa, i) . . . (r n+ \, n + 1)). In the 
sequel, any accepting run p on a word of length n will have 
the previous form. 

We characterize the ATT-definability of a ZNFT by a prop- 
erty that we prove to be decidable. Intuitively, this property 
requires that the outputs produced by loops can be produced 
by a single forward pass: 

Definition f CP-property). Let T be a ZNFT. We say that T 
satisfies the property V, denoted by T \= V, if for all words 
u € dom(T), for all accepting runs p on u, and for all pairs 
of loops (i\,ji) and (12,32) of p such that j\ < i% there 
exist Pi, /?2, 03, Pi, $5 S £*, f,9 ■ N 2 — > S* and constants 



Ci, c' l7 C2, c 2 > such that cx,c 2 ^ and for all ki,k 2 > 0, 
/(fei, k 2 )x v 7 l 1 xiw^ 2 X2W2 2 X3V2 1 X4V3 1 x 5 Wz 2 x e g(ki,k 2 ) 

where r\i — fcjCj + c^, i G {1,2}, ana", a^'s, Uj'i ant/ w^s are 
words defined as depicted in Fig. [2] 

The following lemma will be proved in Section HVl 

Lemma 3. Let T G ZNFT. T ^ V iff T is NFT- definable. 
Moreover, V is decidable and ifT\=V, one can (effectively) 
construct an equivalent NFT. 

Definition 2 (z-motion transductions induced by a 2NFT). 

Let T = (Q,q Q ,F,A) be a f2NFT and 91,92 G Q. The 
transduction £7(91,92) (resp. TZt (91,92)) is defined as the 
set of pairs (1*2, v 2 ) such that there exist u G £*, two positions 
i\ < i 2 (resp. i 2 < an accepting run p of T on u, which 
can be decomposed as p = pi(qi,ii)p 2 (q 2 ,i 2 )p3 such that 
u 2 = u[ii..i 2 ] and 

• 21)^2(92, 12) is a z-motion run 

. out((q 1 ,i 1 )p 2 (q 2 ,i 2 )) = v 2 

z-motions can be of two forms: either they start from the 
left and end to the right, or start from the right and end to the 
left. In order to avoid considering these two cases each time, 
we introduce the notation T that denotes the mirror of T: it is 
T where the moves +1 are replaced by —1 and the moves — 1 
by +1. Moreover, the way T reads the input tape is slightly 
modified: it starts in position n and a run is accepting if it 
reaches position in some accepting state. All the notions 
defined for 2NFTs carry over to their mirrors. In particular, 
(u, v) G R(T) iff (u, v) G R{T). The z-motion transductions 
7vLt(<?i, 92) and £7(91,92) are symmetric in the following 
sense: K T (qi,q 2 ) = £7(91, q 2 ) and £7(91, q 2 ) = TZ^iqi, 92). 

Proposition 2. The transductions TZr(qx, q 2 ) and £7(91,92) 
are ZNFT-definable. 

Proof: We only consider the case £7(91,92), the other 
case being solved by using the equality lZT{qi,q 2 ) = 
£,Tf(qi,q 2 ). We first construct from T a ZNFT Z T (qi,q 2 ) 
which is like T but its initial state is q\, and it can move to an 
accepting state whenever it is in q 2 . However Z T (qi,q 2 ) may 
define input/output pairs (u 2l v 2 ) that cannot be embedded 
into some pair (u, v) G R(T) as required by the definition of 
£r(9i> 92)- Based on Shepherdson's construction, we modify 
Z' T {qx,q 2 ) in order to take this constraint into account. The 
full proof is in Appendix. ■ 
In the next subsection, we show that TZr{qi,q 2 ) and 
£7 (91 j q 2 ) must necessarily be A^FF-definable for T to be 
A^r-definable. For that purpose, it is crucial in Definition 
12 to make sure that the z-motion (qi,ii)p 2 (q 2 ,i 2 ) can be 
embedded into a global accepting run of T. Without that 
restriction, it might be the case that Cr{qi, q 2 ) or TZr{qi,q 2 ) 
is not ATr-definable although the 2NFT T is. Indeed, the 
domain of £t(<7i,<?2) or lZT{qi,q 2 ) would be too permissive 
and accept words that would be otherwise rejected by other 
passes of global runs of T. This is another difficulty when 



lifting Rabin and Scott's proof to transducers, as for automata, 
the context in which a z-motion occurs is not important. 

D. Decision procedure and proof of Theorem Q] 

We will show that the construction squeeze(T) can be 
applied if the following necessary condition is satisfied. 

Lemma 4 (Necessary Condition). If T is NFT -definable, 
then so are the transductions TZt(qi,Q2) and £7^(91,92) 
for all states q\ , q 2 . Moreover, it is decidable whether the 
transductions TZt(qi, Q2) and £t(<717<72) are NFT -definable. 

Sketch of proof: We have seen in Lemma [3] that NFT- 
definability of an ZNFT is characterized by Property V. Let 
Z G ZNFT that defines CT(qi,q 2 ) f° r some qi,q 2 , we will 
thus sketch the proof that Z \= V . 

Consider two loops (i 2 , 32) of a run p of Z on 

some word u, as in the premises of Property V. They induce 
a decomposition of u as u = U1M2M3U4M5 with u 2 = u[i±..ji— 
1] and U4 — u[i 2 ..j 2 — 1]. By definition of the transduction 
Ct(qi, 92)1 any word in dom(Z) can be extended into a word 
in dom(T). By hypothesis, T is M*T-definable, thus there 
exists an equivalent NFT T'. As T" has finitely many states, 
it is possible, by iterating the loops (ii,jx) and (i 2 ,j 2 ), to 
identify an input word of the form 

/ Cl C9 C^i C 1 C o Cl i I 

u = OLU\u 2 u 2 u 2 u-iU^u^u^u^a 

and a run p' of T' on this word which has two loops on the 
input subwords u c 2 2 and u£ . It is then easy to conclude. ■ 

Construction of squeeze(T) Assuming that the necessary 
condition is satisfied, we now explain how to construct 
the flNFT squeeze(T). By hypothesis, the transductions 
£t(<7i,<72) and TZt (91,92) are M*T-definable for all 91,92 
by NFT Lt (91,92) and Rr{qi,q2) respectively (they exist 
by Proposition |2] and Lemma |3J. As already said before, the 
main idea to define squeeze (T) is to non-deterministically 
(but repeatedly) apply Lx{qi, 92), Rt(qi, 12), or T, for some 
91,92 G Q. However when applying RT(qi,q 2 ), the head of 
squeeze(T) should move from the right to the left, so that we 
have to mirror the transitions of RT(qi,q 2 ). 

The transducer squeeze(T) has two modes, Z-mode or T- 
mode. In T-mode, it works as T until it non-deterministically 
decides that the next zigzag is a z-motion from some state 
91 to some state 92. Then it goes in Z-mode and runs 
£7(91,92) or it/r(9i,92), in which transitions to an accepting 
state have been replaced by transitions from q 2 in T, so that 
squeeze(T) returns in T-mode. From those transitions we 
also add transitions from the initial states of £7(92,93) and 
Rt(<12, 93) for all (73 G Q, in case squeeze(T) guesses that the 
next z-motion starts immediately at the end of the previous z- 
motion. We detail the construction of squeeze(T) in Appendix. 

Proposition 3. Let T G flNFT such that T is NFT-definable. 
Then squeeze(£) is defined and equivalent to T. 

Let T G J2NFT. If T is M^T-definable, then the operator 
squeeze can be iterated on T while preserving equivalence 
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Fig. 3. From ZNFT to JVFT. 

with T, by the latter proposition. By Proposition [T] T is TV- 
crossing, and therefore, based on Lemma|2] it suffices to iterate 
squeeze TV 2 times to remove all zigzags from accepting runs 
of T, as stated by the following lemma: 

Lemma 5. Let T be a flNFT with TV states. If T is JNFT- 
definable, then squeeze^ (T) is defined and equivalent to T, 
and moreover, for all (u, v) G R{T), there exists an accepting 
run p of squeeze^ (T) on u such that out(p) = v and p is 
made of forward transitions only. 

Proof of Theorem Q] In order to decide whether a flNFT T 
is TVfT-definable, it suffices to test whether squeeze can be 
applied TV 2 times. More precisely, it suffices to set To to T, 
i to 0, and, while Tj satisfies the necessary condition (which 
is decidable by Lemma HI and i < TV 2 , to increase i and set 
Ti to squeeze(Ti_i). If the procedure exists the loops before 
reaching TV 2 , then T is not TVfT-definable, otherwise it is 
TVFr-definable by the NFT obtained by removing from T N 2 
all its backward transitions. 

IV. From Elementary Zigzags to Lines 

This section is devoted to the proof of Lemma [3] that 
characterizes TVFr-definable ZNFT by the property V and 
states its decidability. Moreover, we give a ZNFT-to-NFT 
construction when V is satisfied. 

We will first prove that Property V is a necessary condition 
for TvT-T-definability. To prove the converse, we will proceed 
in two steps, as it is simpler than in a single step Q First, 
we define a procedure that tests whether a given ZNFT T 
is equivalent to a ZNFT that does not output anything on 
its backward pass (called eZNFT), and then define another 
procedure that tests whether the latter ZNFT is equivalent to 
an NFT. We show that it is always true whenever T \= V. 
This approach is depicted in Fig. [3] The two steps are similar, 
therefore we mainly focus on the first step. 

A. Property V is a necessary condition 

We show that Property V only depends on transductions. 

Lemma 6. Let T, T'eZNFT. IfT\=V and T=T' then T'\=V. 

Proof: Consider two loops ji), (12, 32) as m Property 
V in a run of T" on some word u. They induce a decomposition 

of u as u — U1M2U3M4U5 where 112 — u[i\..(ji — 1)] and 
U4 = u[i2--(j2 — 1)]> with U1U2U3U4U5 G dom(T') for all 
fci,fc 2 >0. 

As T is equivalent to X" and has finitely many states, there 
exist iterations of the loops on U2 and 7/4 which constitute 

6 We could proceed in a single step but the analysis of the output words 
would be much more complex. 
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Fig. 4. Decomposition of the output according to Property V\. 

loops in T on powers of U2 and 114. Formally, there exist 
integers dx,ei,hx,d2,e.2,h,2 with e\,e2 > such that T has 
a run p on the input word u\u 2 l U 2 ^U2Uzu'^u e ^u i 2 u^ which 
contains a loop on the input subwords u^ 1 and u^ 2 . 

We conclude easily by using the fact that T |= V. ■ 
As a consequence, we obtain that Property V is a necessary 
condition for TV^T-definability. 

Lemma l.LetT G ZNFT. IfT is NFT -definable, then T\=P. 

Proof: Let T" be an NFT equivalent to T. It is easy to 
turn T' into a ZNFT T" that performs two additional backward 
and forward passes which output e. Consider two loops {ix,ji) 
and {31, 32) in a run of T", and let us write the output of this 
run as depicted on Fig. [2] These loops are also loops of T', 
and thus we can define /3\ (resp. 02, 03, /?4 and j3§) as Xq 
(resp. V\, x\, W\ and X2), and /, g as the constant mappings 
equal to e. Hence T" \= V, and we conclude by Lemma [6] ■ 

B. From ZNFT to eZNFT 

The goal is to devise a procedure that tests whether the 
first and second passes (forward and backward) of the run 
can be done with a single forward pass, and constructs an 
NFT that realizes this single forward pass. Then, in order to 
obtain an eZNFT, it suffices to replace the first pass of T 
by the latter NFT and add a backward pass that just comes 
back to the beginning of the word and outputs e all the time. 
The procedure constructs an eZNFT, and tests whether it is 
equivalent to T. It is based on the following key property 
that characterizes the form of the output words of the two 
first passes of any ZNFT satisfying V. Intuitively, when these 
words are long enough, they can be decomposed as words 
whose primitive roots are conjugate. 

Definition 3 (Pi -property). Let T G ZNFT with m states, and 
let (u,v) G R{T) where u has length n. Let K = 2.o.m 3 .|E| 
where o = max{\v\ \ {p,a,v,q,m) G A}. The pair (u,v) 
satisfies the property V\, denoted by (u, v) |= Vi, if for all 
accepting runs p on u, there exist a position 1 < t < n and 
w, w' , t±, t2, £3 G S* such that v G wtxt^t^w' and: 

outi[l,^] = w out2 [!.,•£] = £3 outi[i,n]out2[i,n]Etit2 
out 8 [l,n+ 1] = w 1 \U\ < 2K.yi G {1,2,3} 

This decomposition is depicted in Fig. [5] T satisfies property 
V\, denoted T \= V\, if all (u,v) G R{T) satisfy it. 

Proposition 4. Let T G ZNFT. IfT\=V, then T \= V\. 

Proof: • If |out2[l,n — 1]| < K, then clearly, it suffices 
to take £ = n, t\ = out 2 [ra— 1, n], <2 = e, £3 = out 2 [0, n— 1], 
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Fig. 5. Decomposition of the two first passes of a z-motion run with loop. 

w = outi [1, n] and w' = out3[l, n + 1]. 
• Otherwise, |out2[l,n — 1]| > K. Therefore u is of length 
2.m 3 .|E| at least and there exists a (non-empty) loop in 
p. We can always choose this loop such that |out2 [1, *] | < K 
and 1 < |out2 j] < K (see Lemma [T6l in Appendix). 

The loop partitions the input and output words into factors 
that are depicted in Fig. [5] (only the two first passes are de- 
picted). Formally, let u = uiu 2 u 3 such that u 2 — u[i..(j— 1)]. 
Let £0 = outi[l,i], vi = outi[i,j], x\ = outi[j, n]out2[j, n], 
v 2 = out 2 [i,j], x 2 = outi[l,i], x 3 = out 3 [l,i], v 3 = 
out3[z,j] and £4 = out 3 [j, n+1]. In particular, we have |x 2 | < 
K,l < \v 2 \ < K and xoViXi?;2X2X3i>4X4 € T(u). Since 
is a loop we also get xqv\ xiv 2 x 2 x 3 v 3 X4 6 T(uiu 2 u 3 ) for 
all k > 0. We then distinguish two cases: 

1) If vi 7^ e. We can apply Property V by taking the second 
loop empty. We get that for all k > 

/(fc)x < +c ' xiv* c+c ' x 2 x 3 v k 3 c+c ' x 4 g(k) = Wife 

where f,g : N -> S*, c £ N >0 , c' £ N, and Pi,p2,Ps £ £*• 
Since the above equality holds for all fc > 0, we can apply 
Lemma[T]and we get fi(vi) ~ /x(/?2) an d (^(^2) ~ ^{ v 2)< an d 
therefore ~ n(v 2 ). So there exist x,y £ S* such that 

Ui G (#2/)* and ^2 £ {yx)* ■ One can show (see Lemma [T71 
in Appendix) that vixii^ £ x(yx)*. Then it suffices to take 
£ = i, w = xo, ti = x, ^2 = 2/- T an d *3 = x 2- 

2) The second case (vi = e) is more complicated as it 
requires to use the full Property V, using two non-empty loops. 
First, we distinguish two cases whether |outi[7, n]\ < K or 
not. For the latter case, we identify a second loop and then 
apply Property V. Details can be found in the Appendix|B] ■ 

Construction of an eZNFT from a ZNFT We construct an 
eZNFT T from a ZNFT T such that R(T') = {(u,v) £ 
R(T) I (u,v) \= Vi}. Intuitively, the main idea is to perform 
the two first passes in a single forward pass, followed by 
a non-producing backward pass, and the final third pass is 
exactly as T does. Therefore, T" guesses the words ti,t 2 
and ta and makes sure that the output v is indeed of the 
form characterized by V\. This can be done in a one-way 
fashion while simulating the forward and backward passes in 
parallel and by guessing non-deterministic ally the position I. 
In addition, the output mechanism of T" exploits the special 
form of v: the idea is to output powers of t 2 while simulating 
the two first passes. 

First, let us describe how T" simulates the forward and 
backward passes in parallel during the first forward pass. It 



guesses both the state of the backward pass, and the current 
symbol (this is needed as the symbol read by the backward 
transition is the next symbol). The first state (q*) guessed for 
the backward pass needs to be stored, as the last (forward) 
pass should start from q*. The transducer can go from state 
(p, q, a) to state (p', q' , a') if the current symbol is a and there 
is a (forward) transition (p, a, x,p', +1) and a (backward) 
transition (q 1 , a 1 ,y, q, —1). Therefore if Q is the set of states 
of T, T' will use, on the first pass, elements of Q x Q x S in 
its states. The transducer T' can non-deterministically decide 
to perform the backward and non-producing backward pass 
whenever it is in some state (q, q, a) and the current symbol 
is a. This indeed happens precisely when the forward and 
backward passes are in the same state q. If the current symbol 
is not the last of the input word, then the whole run of T' will 
not be a z-motion and therefore will not be accepting. 

Second, we describe how the eZNFT T', with the guess of 
ti,t 2 ,tz, verifies during its first forward pass that the output 
has the expected form, and how it produces this output. During 
the first pass, T" can be in two modes: In mode 1 (before the 
guess £), T' verifies that the output on the simulated backward 
pass is i 3 and proceeds as T in the first forward pass (it outputs 
what T outputs on the forward pass). Mode 2 starts when the 
guess £ has been made. In this mode, T" first outputs ti and 
then verifies that the output of the forward/backward run from 
and to position I is of the form t\t%. It can be done by using 
pointers on t\ and t 2 . There are two cases (guessed by T')\ 
either ti ends during the forward pass or during the backward 
pass (using notations of Figj4] either t\ is a prefix of x, or x 
is a prefix of ti). 

In the first case, T" needs a pointer on ti to make sure that 
the output of T in the forward pass starts with t\. It also needs 
a pointer on t 2 , initially at the end of t 2 , to make sure that the 
output of T on the simulated backward pass is a suffix of t* 2 
(the pointer moves backward, coming back to the last position 
of t 2 whenever it reaches the first position of t 2 ). Once the 
verification on ti is done, T' starts, by using a pointer initially 
at the first position in t 2 , to verify that the output of T in 
the forward pass is a prefix of Once the forward and the 
simulated backward passes merge, the two pointers on t 2 must 
be at the same position, otherwise the run is rejected. 

During this verification, T" also has to output a power of t 2 
(remind that it has already output ti). However the transitions 
of T may not output exactly one t 2 , nor a power of t 2 , but 
may cut t 2 before its end. Therefore T' needs another pointer 
h to know where it is in t 2 . Initially this pointer is at the first 
position of t 2 (h = 1). Suppose that T" simulates T using 
the (forward) transition (p, cr, x,p', +1) and the (backward) 
transition (q',a',y,q,—l). If this step occurs before the end 
of ti, then T outputs t%[h..(h + \y\)] {t 2 is the infinite 
concatenation of t 2 ), and the pointer h is updated to 1 + ((h + 
\y\ - 1) mod \t 2 \). Otherwise,! 1 ' outputs 1%[h..(h+ \x\ + \y\)] 
and h is updated to 1 + ((h + \x\ + \y\ — 1) mod \t 2 \). 

The second case (when T' guesses that t\ ends during the 
backward pass) is similar. T' has to guess exactly the position 
in the output where ti ends. On the first pass it verifies that 
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Fig. 6. Decomposition of the output according to Property TV 



the output is a prefix of t\, and on the simulated backward 
pass, it checks that the output is a suffix of t\ (and outputs 
as many ti as necessary, like before), until the end of t\ is 
guessed to occur. From that moment it enters a verification 
mode on both passes. 

The main property of this construction is that no wrong 
output words are produced by T", due to the verification and 
the way the output words are produced, i.e. for all (u, v) G 
i?(T'), we have (u,v) G R(T). 

Proposition 5. Let T G ZNFT. R(T') = {(u,v) G 
R(T) I (u,v) |=7>i}. 

Lemma 8. Let T G ZNFT. IfT\=V, then T is equivalent to 
the (ZNFT T' . Moreover, the latter is decidable. 

Proof: If T |= V, then by Proposition @] T \= V\. 
Therefore by Proposition T and T" are equivalent. 

We know that i?(T') C i?(T), and since T and T" are both 
functional, they are equivalent iff dom(T) C dom(T'). Both 
domains can be defined by NFAs. Those NFAs simulate the 
three passes in parallel and make sure that those passes define 
a z-motion. Therefore testing the equivalence of T and T' 
amounts to test the equivalence of two NFAs. ■ 

C. From (ZNFT to NFT 

We have seen how to go from a ZNFT to an eZNFT. We 
now briefly sketch how to go from an eZNFT to a (functional) 
NFT. Given an eZNFT T', we define an JNFT T" such that 
T" and T" are equivalent as soon as T" |= V. The ideas are 
very similar to the previous construction therefore we do not 
give all the details here. 

We exhibit a property on the form of output words produced 
by an (ZNFT that verifies V . Intuitively, apart from the 
beginning of the first pass, and the end of the second pass, if 
the two passes produce long enough outputs, then these outputs 
can be decomposed so as to exhibit conjugate primitive roots. 

Definition 4 CP 2 -property). Let T" G (ZNFT with m states, 
and let (u,v) G R(T') where u has length n. Let K = 
2.o.m 3 .|S| where o = max{\v\ \ (p,a,v,q,m) G A}. The 
pair (u, v) satisfies the property Vi, denoted by (u, v) \= V2, 
if for all accepting runs p on u, there exist two positions 
1 < l\ < £2 < n and w, w' , t±, t2, £3 G S* such that: 

outi[l,4]=w |tj| < 3.K, Vi G {1,2,3} 

out 3 [£ 2 ,n+ 1} = w' |outi[4,n]| < 3.K 

outi[£i,n]out 3 [l,4] e hqt 3 |out 3 [l,4]| < 3.K 
This decomposition is depicted in Fig. [6] T' satisfies property 
V2, denoted T \= V% if all (u, v) G R{T') satisfy it. 



The proof of the following proposition uses the same 
structure and techniques as that of Proposition |4] however it 
has much more cases to consider and are therefore the most 
technical proof of this paper. Using a case analysis, we identify 
loops in the runs, and apply Property V to show that the output 
words have the expected form. 

Proposition 6. Let T G (ZNFT. IfT |= V, then T |= V 2 - 

We can now sketch the construction of an fNFT T" which 
recognizes the subrelation of T' defined as {(u,v) G R(T r ) \ 
(u,v) \= 7^2}- Again, the construction is rather similar and 
uses the same techniques to that of T" starting from T. 

The transducer T" simulates, in a single forward pass, 
the three passes of T 1 . Hence it also checks that the run 
of the ZNFT T' it simulates is a z-motion run, which is a 
semantic restriction of accepting runs of ZNFTs. The fNFT 
T" also guesses positions l\ and £2, and uses three modes 
accordingly. It also guesses the words t%, t% and t%, and words 
for out,3[l, £1} and outi [£2, n], which are all of bounded length 
(see Property V2). The output of T" is produced according to 
the mode, using pointers to check the guesses, similarly to T". 

If all the guesses happen to be verified, it outputs the 
correct output word, otherwise the input word is rejected. As 
a consequence, T" recognizes a subrelation of T" and thus 
checking the equivalence of T" and T" amounts to checking 
the equivalence of their domains (as the two transducers are 
functional), which is decidable. Thanks to Proposition [6] we 
obtain the following result: 

Lemma 9. Let T G (ZNFT. IfT' \= V, then T is equivalent 
to the fNFT T". Moreover, the latter property is decidable. 

Proof of Lemma|3j Lemma|7]states that if T is M^T-definable, 
then T \=V. Conversely, ifT\=V, then by Lemma[8] the first 
construction outputs an equivalent (ZNFT T'. By Lemma |6] 
we have T' \= V . By Lemma |9] the second construction 
outputs an equivalent NFT T". Therefore T is A^r-definable 
by T". In order to decide whether T \= V, it suffices to 
construct T', check that T and T" are equivalent, and then 
construct T" and check whether T" and T" are equivalent. 
Both problems are decidable by Lemma |8] and |9] 

V. Discussion 

Complexity The procedure to decide (f2NFT,NFT)- 
definability is non-elementary exponential. This is due to the 
ZNFT-to-NFT construction which outputs an NFT of doubly 
exponential size. Indeed, the first step of this construction 
transforms any ZNFT with n states into an (ZNFT with at 
least |S| 4oTl l E l states, as the (ZNFT has to guess words 
of length 4on 3 |E|, where o is the maximal output word of 
a transition. The (ZNFT-to-NFT construction also outputs 
an exponentially bigger transducer. Therefore the squeeze 
operation outputs a transducer which is doubly exponentially 
larger. Since this operation has to be iterated N 2 times in the 
worst case, where N is the number of states of the initial 
f2NFT, this leads to a non-elementary procedure. 



On the other hand, the best lower bound we have for this 
problem is PSpace (a simple proof that reduces the emptiness 
problem of the intersection of n DFAs is given in Appendix). 

This leaves a huge complexity gap which we think can be 
reduced by an improved (elementary) upper bound. Indeed, 
given an initial f2NFT T with n states, the length of the 
words that have to be guessed each time the ZNFT-to-NFT 
construction is applied is likely to be only exponential in n, 
for the following reason. Consider a loop of an accepting 
run of T. The difference can be bounded by n\, otherwise 
there is a crossing sequence that is repeated inside this loop 
and we can therefore find a smaller loop. All the output words 
produced on this loop by the respective passes of the run 
must be conjugates if T is A/Fr-definable, and therefore in 
the ZNFT-to-NFT construction, it suffices to guess primitive 
roots of length at most n\. 

Succinctness It is already known that 2DFAs are exponentially 
more succinct than NFAs J25). Therefore this result carries 
over to transducers already for transducers defining identity 
relations on some particular domains. However we show here 
a stronger result: the succinctness of 2NFTs also comes from 
the transduction part and not only from the domain part. We 
can indeed exhibit a family of A/Fr-definable transductions 
(R n ) n that can be defined by 2DFTs that are exponentially 
more succinct than their smallest equivalent NFT, and such 
that the family of languages (dom(i?„))„ does not show an 
exponential blow up between 2DFAs and NFAs. 

For all n > 0, we define R n whose domain is the set of 
words #u# for all u e {a, b}* of length n, and the transduc- 
tion is the mirror transduction, i.e. i?„(#u#) = 

Clearly, R n is definable by a 2DFT with 0(n) states that 
counts up to n the length of the input word by a forward pass, 
and then mirrors it by a backward pass. It is also definable by 
an NFT with 0(2 n ) states: the NFT guesses a word u of 
length n (so it requires 0(2") states), outputs its reverse, and 
then verifies that the guess was correct. It is easy to prove that 
any NFT defining R n needs at least 2" states by a pumping 
argument. On the other hand, the domain of R n can be defined 
by a DFA with 0{n) states that counts the length of the input 
word up to n. Note that the alphabet does not depend on n. 

Further Questions We have shown that (j2NFT,NFT)- 
definability is decidable, however with a non-elementary pro- 
cedure. We would like to characterize precisely the complexity 
of this problem. Our procedure works for functional 2NFTs. 
They are known to be equivalent to 2DFTs. Therefore we 
could have done our proof directly for 2DFTs. However (func- 
tional) non-determinism was added with no cost in the proof 
so we rather did it in this more general setting. Nevertheless, it 
is still unknown whether our result extends to relations instead 
of functions. 

Our proof is an adaptation of the proof of Rabin and 
Scott Q to transducers. Alternative constructions based on 
the proofs of Shepherdson (01 or Vardi [16] would also be 
of interest. Moreover, we could have taken models equivalent 
to f2NFTs as input, like streaming string transducers ifKfl or 



MSO transformations O, (9). 

We would like to investigate transducers on infinite strings, 
as, in some contexts, streams may not terminate. We also 
plan to consider trees instead of strings, following our initial 
motivation from XML applications. 
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Appendix A 
Complements to SectionHlTI 

A. Iterative z-motions removal (proof of Lemma]2$ 

We define the crossing number of the position x G 
{1, . . . , to} as the number |{i | a, = x}\. Hence the sequence 
s is X-crossing if all its positions x G {1,2..., m} have a 
crossing number less or equal than K. 

We say that two z-motions z\ — (rj,r,+i), z 2 = (rj,rj+t) 
are consecutive, resp. positionally disjoint, if j = i + 2, resp. 
max(o n ,a n+1 ) < min(a rj , a Tj+1 ) (or max(o rj ,a rj+1 ) < 
min(a ri , a ri+1 )). Moreover we say that z\ and z 2 are disjoint 
if they are not consecutive or if they are positionally disjoint. 
Equivalently, the z-motions z\ — a^^ki+ii ■ ■ ■ )<Zfc 2 an d 
Z2 — afc 3 , o-k 3 +i, ■ ■ ■ j afe 4 are disjoint if and only if fc 2 < &3 
or /c4 < fci. 

Lemma 10. 7f s ;s K -crossing, then for all z\ , z 2 , . . . , z t 

consecutive z-motions, for all i < t — K, Zi and zi+k ore 
positionally disjoint. 

Proof: Let j G {1, ...,/} such that z\ = (rj, rj+%), z 2 — 
(rj +2 ,rj +3 ),... and, wlog, assume a r . < a r +1 . As a 
consequence of the definition of z-motions, consecutive z- 
motions form a stair, that is, we have a r . , <n r .,,,.,„ and 
a rj+2i+1 < a ri+2(i+1)+1 . If Zi and z i+ ^ are not positionally 
disjoint, all z k for i < k < i + K share the leftmost position 
of Zi, i.e. they share a rj+ , (i _ ir Therefore s is not if-crossing. 

■ 

We say that a position x is /n between the positions y and 
z whenever y < x < z or z < x < y. We say that the pair of 
reversals (or a z-motion) (r, s) is nested into the pair (/, s') 
if a r and a s are in between a r / and a s '. 

Lemma 11. Let (ri,rj), with i < j, be a pair of reversals 
and z — (r, r') be a z-motion. If a r is in between a Ti and a rj , 
and r G {ri+i, Tj+i}, or if ' a r > is in between a Ti and a Tj , and 
r' G {ri—x, rj—i}, then z is nested in (ri,rj). 

Proof: Suppose that z = (ri-i. r,_i) (the other cases are 
proved similarly). Wlog assume a r < a n , so by hypothesis 
we have a r , < a ri _ 1 < a n . Then, as a consequence of basic 
properties of reversals, a n _ 1 < a n _ 2 (because a n _ 1 < a n ). 
Moreover as (fj_2,rj-i) is a z-motion we have a n _ 2 < a n . 
Therefore we have the inequalities: a rj < a n _ 1 < a ri _ 2 < 
a n , which means that z is nested in (ri,rj). ■ 

The one-step sequence s' is obtained from s = ai, . . . , a n 
by removing the z-motion z = , • • • , dk 2 also defined by its 
reversals as z = (n,n+i), if s' = ax, ... ,a n ,ak 2 +i, ■ ■ ■ ,a„. 
Note that the sequence s' is a one-step sequence because s is 
one and because a Ti — a^ 2 . The sequence s' has exactly 2 less 
reversals than s and each reversal of s not in z corresponds to 
one of the reversals of s', each z-motion z' = (r 3 , r J+1 ) of s 
such that rj,rj + i ^ {r{,r i+ i} is also a z-motion in s' (up to 
an index shift). Note also that positionally disjoint z-motions 
in s are still positionally disjoint in s'. 

We define the function squeeze(s) as the function that asso- 
ciates to a one-step sequence s the set of one-step sequences 



that can be obtained from s by removing some pairwise 
disjoint z-motions of s. 

We say that a set Z of z-motions of s is consistent if no two 
z-motions of Z share a reversal, that is, if (r, r'), (s, s') G Z, 
then r, r' ^ s and r, r' ^ s'. The consistent set Z is maximal 
if it is not strictly contained into any other consistent set of 
z-motions of s. 

Lemma 12. Let s be a K-crossing one-step sequence. If Z 
is a consistent set of z-motions of s then there is some s' G 
squeeze^ (s) that contains no z-motion of Z. 

Proof: Let Z = {zi, Z2, ■ ■ ■}, where the z-motion are 
ordered, i.e., if Zi = (r,r + 1) and Zj+i = (s,s + 1) then 
r + 1 < s. We define sq = s, and, for all < i < K, Si is 
obtained from Sj_i by removing Zi, Zi + K, Zi + 2K ■ ■ ■ ■ Clearly 
Sj+X £ squeeze l+1 (s) if Zj+^K and Zj+^'K are disjoint in s^. 
We consider two cases. Either Zi + jK and Zi + j>K belong to 
a sequence of z-motions in Z that are consecutive in s, in 
that case we can apply Lemma [10] which shows that they are 
disjoint. Otherwise, z^k an d Zi+i'K do not belong to such 
a sequence of z-motions in Z, that is, there exists a reversal 
r that does not appear in any z-motion of Z and which is 
between the second reversal of Zi+jK and the first reversal of 
Zi+j'K, but then they cannot be consecutive in s' (they are 
also separated by r in s'), so, by definition, they are disjoint. 

■ 

Proof of Lemma [2} Let s% = s, and for all i > 1, let 
Zi be a maximal consistent set of z-motions of Sj, and s^+i 
be the one-step sequence obtained from Si by removing Zi. 
We show that each z-motion z in Zi has one of its positions 
whose crossing in s is at least i + i' where i' is the crossing 
of the corresponding (some shift might be applied) position in 
Sj+i. This trivially holds for si = s, so suppose it holds for i 
and let us show it also holds for i + l. Let r[ , r' 2 , . . . , r[, be 
the reversals of Sj, let z = (?"^,r^/) be a z-motion in 
(recall that we abuse notation and refer to the reversals of Sj + i 
using the reversals of Sj though there is a shift of index for 
some of them). As Zi is maximal, z is not a z-motion in s^, 
so there is a z-motion z' = (r' k ,r' k+1 ) G such that one of 
the following holds: 

• k = i' + 1 or k = j' + 1 and a r > is in between a r > and 

a r'_, 

• k = i' — 2 and <v is in between a,.' and a r > 

Intuitively the above property states that one of the z-motions, 
z', in Zi must prevent z to be a z-motion in Sj, that is, z' is 
somehow 'in' z. In each of these two cases we can apply 
Lemma [TT] which states that z' is nested in z. By induction 
hypothesis, one of the position of z' has a crossing number 
in s of at least i + i', where i' is the crossing number of the 
corresponding position in Sj+i. As Si+2 is obtained from s<+i 
after removing z, we have i' > 1 + i" where i" is the crossing 
number of the corresponding position in So we have 

proved that the crossing number of this position is at least 

To conclude, as s is X-crossing, all positions are at most 



if-crossing, therefore the property we just proved implies that 
Si for i > K has no z-motion, that is Sj = 1,2, ... ,m. 
By Lemma [12] K applications of squeeze are sufficient to 
remove a consistent set of z-motions, therefore 1, 2, . . . , m is 
in squeeze^ (s). ■ 

B. Proof of Proposition \2\ 

A crossing sequence s is repetition-free if each state occurs 
at most once in s. If Q is the set of states of A, we denote by 
CS(Q) the set of repetition-free crossing sequences of A. 

Based on Shepherdson's construction, it is possible to 
construct a one-way automaton whose states are sequences 
of states, such that any run p of A maps to the sequence 
of crossing sequences of p, and conversely any sequence of 
crossing sequences of this automaton maps to a run of A. 
This automaton may have infinitely many states, but it is 
well-known it is sufficient to consider repetition-free crossing 
sequences of states only l24l . 

Lemma 13 ( ll24l ). For all 2NFAs A with set of states Q, it 
is possible to construct an equivalent NFA CS(A) whose set 
of states is CS{Q), and such that for all accepting runs p' of 
CS{A) on u, there exists an accepting run p of A on u such 
that CS{p) = p'. 

Lemma 14. Let A be a 2NFA with set of states Q, and 
9i,92 E Q- Let M qijq2 be the language of words U2 such 
that there exists a word u G L(A), an accepting run p of A 
on u such that p — pi(q%, ii)/>2(92, *2)P3 an d U2 — u\i\..i2\. 
Then M 9li92 is regular. 

Proof: Given two sequences of states s\ and s 2 , the 
language Acc Sl:S2 is defined as the set of words u 2 G S* 
such that there exist a word u G £*, two positions i\ < %2 
such that U2 = u[i\..i2\, and an accepting run p on u 
such that CS(p,ii) = si and CS(p,t2) — S2- In other 
words, S2 is accessible from si by U2- It is easy to show 
that for all s\, S2, there exists repetition-free sequences s[, s' 2 
such that Acc SliS2 = Acc^ ^, Therefore one can consider 
repetition-free sequences only. We have seen (Lemma \l3[ that 
one can construct an NFA whose states are the repetition- 
free crossing sequences of the runs of T. An easy reacha- 
bility analysis of this NFA allows one to construct an NFA 
A qi> q 2 whose states are repetition-free crossing sequences of 
T and such that M quq2 = U{^ cc si,s 2 I 9i G s i,92 G 
S2, si, S2 are repetition-free}. ■ 
Proof: The transduction £7^(91,92) is a function, other- 
wise T would not be functional. 

We define an intermediate ZNFT Z! r (qi,q 2 ) that mimics 
T but starts initially in the state q\ and whenever it reaches 
the state q2, it non-deterministic ally decides to go to a 
fresh accepting state q'^. Formally, Z' T (qi 1 q2) = (Q U 
{<?/}, <?!,{<?/}, A') where A' = A U {{q 2 , a, e, q' +1) | a 6 
£}. Clearly, to any accepting run of Z' T {qi,q2) on a word 
U2 G S* corresponds a z-motion run of T on U2 of the form 
p' 2 = (qi,l)p 2 (q2,\ u 2\) and conversely. However Z' T (qi 1 q 2 ) 
is too permissive as it does not check that p' 2 can be embedded 



into a global accepting run of T. We now show how to restrict 
the domain of Z' T (qi,q 2 ) to take this further constraint into 
account. 

By a simple adaptation of Shepherdson's construction (see 
Lemma [Pill, the language M qi>qs of words U2 such that there 
exists u G dom(T) and an accepting run p of T on u such 
that p — pi(qi,h)p2(q2,i2)p3 and U2 — ufii.-ia], can be 
defined by an NFA A qi . q2 .The transducer Zr(qi, 92) is finally 
defined as Z' T {q\,q2) where during the third and last pass, it 
also checks that the input word is in M qim by running A qijQ2 
in parallel via a product construction. 

Let us briefly explain why this construction is correct. 
Suppose that (u 2 ,v) G Z T {q\,q2)- We have u 2 G M quq2 , 
therefore there exist u G S* and two positions i\ < 12 such 
that 112 = u[ii..i2], and an accepting run p of T of the 
form /9i(qi,ii)p 2 (<72, «2)p3- The subrun ii)p 2 (q2, 12) is 
not necessarily a z-motion, and it does not necessarily outputs 
v. However since (v,2,v) G Zt(qi,Qs), we also have that 
{u 2 ,v) G Z' T (qi,q2), and therefore there exists a z-motion 
run p 1 of T from qy to q 2 on u 2 - One can therefore substitute 
(<7i, ii)p2(l2, ^2) by p' in p (modulo a shift of the positions 
occurring in p'), and one gets a new run 7 = p\p' p2- The 
run 7 is still an accepting run of T on u, and therefore 
(u 2 ,v) G Lt(<7ij<72)- The converse is easy by applying the 
definitions. ■ 

C. Proof of Lemma [4] 

Proof: As in the proof of Proposition [2] we consider only 
the transductions Cr(qi, 92), the other case being solved by 
using the equality TZr(qi,q2) = ^t('? 1 '9 2 )- Let Z G ZNFT 
that defines £t (91,92) for some 91,92 and suppose that T is 
M^T-definable. By Lemma[3]we have to show that Z |= V. Let 
u G dom(Z) of length n and p = (pi,l) • •• (p n ,n){q n -\,n — 
1) . . . (91, l)(r2,2) . . . (r n+ i,n + 1) an accepting run of Z on 
u. Let (ii,ii) and (^2,^2) be two loops of p such that ji < i 2 . 
These loops induce a decomposition of the input word u as 
u = U1U2U3U4U5 with U2 = u[i±..ji — 1] and M4 = u[«2--j2 — 
!]• 

As (ii,ji) and {12,32) are loops in p, for any k\,k,2 > 0, 
we have UiU 2 1 UsU^ 2 'U5 G dom(Z). By definition of the 
transduction £7^(91,92), any word in dom(Z) can be ex- 
tended into a word in dom(T). Thus, for any ki,k 2 > 0, 
there exists OikxMi a 'k k ^ ^* suc b that u(k 1 ,k 2 ) = 

ctk 1 M u l u 2 lu i u i u ^ a 'ki,k 2 G dom (^)- 

In addition, by assumption, T is A^Fr-definable and thus 
there exists an NFT T' such that T = T'. We consider such 
an NFT T', and denote by TV its number of states. Let us 
consider ki = k 2 = N + 1. There exists an accepting run 
p' of T" on the word u(k\,k2). Consider the state in which 
is this run just before the z-th iteration of the word u 2 , for 
i G {1, . . . , fci}. As fci = N + 1, two of these states must be 
equal. A similar reasoning can be done for the powers of the 
word U4. As a consequence, there exist constants Ci,c^ > 
with i G {1, 2, 3} such that c 2 , c' 2 > and the word u(k±, k 2 ) 



can be decomposed as follows: 

u(k 1 ,k 2 ) = ak 1 ,k 2 u i u 2 lu T u 2 3u 3'>M 1 ul 2 ul 3 U5a' klk2 

with the property that p' contains two loops on the input 
subwords u 2 2 and 

To conclude, we let fa (resp. fa, fa, fa, fa) be the 
output produced by p' on the input subword u\v^ (resp. u 2 2 , 
u^u^u^, u c ^u§), and /(fcijfe) (resp. g(ki,k 2 )) be the 
output produced by p 1 on the input subword a(fci,fc2) (resp. 

■ 

D. Definition of squeeze(T) 

We let L T (gi,g 2 ) = {Q qi m , g^ 1 ' q2 , F qi < q2 , A 91 < q2 ) and 
i?T (gi , = (-P 91 < 92 , Pi 1 ^ , G 91 < 92 , r 91 > 92 ) for all qi,q 2 &Q. 

We let squeeze(T) = (Q',Q^,F',A') and show formally 
how to construct it. For more convenience here we assume 
that squeeze(T) can have a set of initial states. It will be 
easy to transform it into a (usual) 2NFT. We let Q' — Q W 
y{Q».wwP»-» | 91, g 2 G Q}, Q{, - {g }U{g 91 < 92 | gi ,g 2 £ 
Q} U {p q uq2 | gi,g 2 G Q], F' = F and A' is the least set 
satisfying for all gi, q% G Q: 

• AyW M!£ «A«*CA'; 

• V(p,a,v ) qi,m)eA, (p,a,v,q qi ' q2 ,m)eA'; 

. Vg G F qi ' q2 , V(p, a, u, g, +1) G A 91 ' 92 , 
V(q 2 ,a,v',q 3 ,m) G A, (p,a,vv',q 3 ,m) G A' 

. Vg G F quq2 , V(p, o, u, g, +1) G A 91 ' 92 , 

W(q 2 ,a,v',q 3 ,m) G A, for all g 4 G Q, 

. Vg G F 91 ' 92 , V(p, o, u, g, +1) G A 91 ' 92 , 
Vg 3 G Q, V(gjf ' 93 ,a,t/,g',m) G A 92 ' 93 U T 92 ' 93 , 
(p, a, w', g', to) G A' 

and similarly: 

• l+J c r,r 91 - 92 C A'; 

. V(p, a, v, gi,m)GA, (p, a, v,Pc, 1:92 , m)eA'; 

. Vg G G 91,92 , V(p,a,w,g,-1) G T 91 - 92 , 

V(g 2 ,a,v',g 3 ,m) G A, (p, a, W, g 3 , m) G A' 
. Vg G G 91,92 , V(p,a,w,g,-1) G r 91,92 , 

V(g 2 ,a,w',g 3 ,m) G A, for all g 4 G Q, 

(p,a,W,g 93 ' 94 ,m) G A' 
. Vg G G 91 ' 92 , V(p, a, v, g, -1) G r 91,92 , 

Vg 3 G Q, V(g 92,93 ,a,w',g',m) G A 9 ' 2 ' 93 U T 92 ' 93 , 

(p, a, w', g', to) g A' 

£. Proof of Proposition 

Proof: Since squeeze(T) contains T as a subtrans- 
ducer, we have R(T) C i?(squeeze(T)). Let us show that 
i?(squeeze(T)) C R(T). Let G i?(squeeze(T)). There- 

fore there exists an accepting run p of squeeze(T) on u that 
outputs v. We are going to construct an accepting run of T 
on u that outputs v, this can be done by induction on the 
number of times p goes in Z-mode. If it never does so, p is 
accepting run of T and we are done. Otherwise suppose that 
p goes at least once in Z-mode for some gi,g 2 G Q. Note 
that the set A' consists of A, the sets A M and T p ' 9 for all 



p, g G Q, and new transitions of three kinds (of the form 
(p,a,vv',q3,m),(p,a,vv',q q l a q ' l ,m) and (p,a,vv' ,q' ,m) in 
the definition). Consider the first use of such a transition t in 
p. One can decompose p as p\p 2 tp 3 where p\ is in J-mode, 
p 2 in Z-mode, and assume that p 2 t is a forward run on a factor 
u 2 of u (the case of a backward run is symmetric). 

Let us inspect the case where t = (p, a, vv', g 3 , m). The 
other two cases (depending on the form of t) are proved 
similarly. Suppose that p G Q qi,q2 . Then it means that 
{u 2 ,v) G CT{qi,q 2 ), and therefore one can easily reconstruct 
a z-motion run p' 2 of T on u 2 from gi to g 2 that outputs v. 
Then by definition of A', we know that there exists a transition 
from g 2 to g 3 that produces v'. By induction we can also 
transform p 3 into a run p' 3 of T that ends in an accepting state 
and outputs the same word. Therefore p'ip' 2 (q 2 , a, v', g 3 , m)p' 3 
is an accepting run of T on u that outputs the same word as 
p. Therefore (u,v) G R(T). ■ 

Appendix B 
Complements to SectionHVI 

A. Technical results 

Lemma 15. Let £,r, A be three finite alphabets, \P a mor- 
phism from T to E* and $ a morphism from T to A. Let 
M = max{\^(~f)\ | 7 G T}. For all words u G V*, 
if \^{u)\ > (|A| + 1).M, then there exist two positions 
1 < ki < k 2 < \u\ such thafy 

1) |*(u[l..(fci-l)])| < (|A| + 1).M 

2) 1< |*(«[fci..(fc 2 -l)])| <(|A| + 1).M 

3) $(«[*i]) = $(«[**]). 

Proof: Let L(u) be the set of loops that are strictly 

contained in u, i.e. L(u) — | 1 < i < j < \u\, (i 7^ 

1) V (j ^ \u\), ®{u\i}) = $(u[j])}. We first show the 
following by induction on 




|*(«)| > (|A| + l).M 
3{iJ) G L(u), 1 < \V(u[i..j})\ < (|A| + 1).M 



If \u\ = (resp. |u| = 1) then = (resp. |*(u) < 

M) and therefore the above implication is obviously satisfied. 
Otherwise suppose that |u| > and > (|A| + l).M. 

Therefore we have \u\ > |A| + 1 > 2, and |w[2..|u|]| > |A|, and 
so by the pigeon-hole principle there exist two positions i < j 
in u[2..|u|] such that = <5>(u\j]), so that L(u) ^ 0. 

Suppose that for all G L(u), &(u[i..(j — 1)]) = e. If we 
remove maximally from u all the factors of u from position i to 
position (j — 1) for all G L(u), one obtains a word v such 
that L(v) = and \^{v)\ = \$>(u)\ > (|A| + l).Af. Moreover 
M < |A| + 1 since L(v) = 0, but this contradicts > 
(|A| + 1).M by definition of M. Since L(u) ^ 0, we get the 
existence of (io,jo) G L(u) such that ^>(u[i ..(j — 1)]) ^ e. 
If |*(u[i ..(jo - 1)])| < (|A| + l).M we are done. Otherwise, 
since |w[?o--Oo — 1)]| < \ U V by induction hypothesis we get 
the existence of a pair G L(u[io..(jo — 1)]) such that 

7 In this Lemma, if fci = 1 then we let u[l..(ki — 1)] = e 



1 < |*(«[t ..(7o - - 1)])| < (|A| + 1).M, from 

which we can conclude by taking i = i* + io — 1 and j = 
j* +i — 1 (note that (i, j) e L(u)). 

This shows items (2) and (3) of the Lemma. Again by 
induction on \u\ and by using (i), we prove the lemma. 
If |u| = or |u| = 1, then the implication obviously 
holds. Otherwise assume that |*(u)| > (|A| + l).M. By 
(i) there exists (k\,k2) G L(u) that satisfies (2) and (3). If 
\^{u[l..(kx - 1)])| < (|A| + l).M we are done, otherwise by 
induction hypothesis, there exists (k[, k' 2 ) G L(u[l..(ki — 1)]) 
which satisfies (1), (2) and (3), from which we can conclude. 

■ 

Lemma 16. Let T e ZNFT with m states. Let o be the 
maximal length of an output word in a transition of T and 
K = 2.o.m 3 .|E|. Let p be a run on a word u of length 
n. We write p as the sequence (pi, 1) . . . (p n , n)(q n -i, n — 
1) . . . (qi, l)(r 2 , 2) . . . (r„+i,n + 1) and let q n = p n and 
r% = q\. Let 1 < k < £ < n such that out2 [fc, ^] > K. 
There exists a loop in p such that k < i < j < I and 

1) |out 2 [M]| < K 

2) 1 < |out 2 [*,j]| < K. 

Proof: We show this result by using Lemma [15] 

We consider the alphabet A 3 x E, where A denotes the 
set of transitions of T. Given a triple of transitions 8 = 
((se, ae, ii£, s' e )i<£<3), and a letter a G E, we define the map- 
pings and <£> as ^(0, a) = U2 and $(6>, a) = (si, S2, S3, a). 
Then, we associate to the run p, considered between positions 
k and I, a word over this alphabet of length t—k, indexed from 
k to I — 1, and defined as r\ = {a m )k< m <t-i, where <j m is 
composed of the three transitions used respectively to go from 
configuration (p m ,m) to configuration (p m+ i,m + 1), from 
configuration (q m+ i,m + 1) to configuration (q m ,m), and 
from configuration (r m ,m) to configuration (r m+ i,m + 1), 
and of the letter u[m]. 

Using these definitions, we have = out2 and, 

for any k < m < £ - 1, $(cr fc ) = (p k , q k , r k , u[k}). Then it 
suffices to apply Lemma [TBI to get the result. ■ 

Lemma 17. Let x,y,z,t G E* such that x =/= e and y =/= e. 
Suppose that for all i > 0, x l yz l is a prefix oft w . Then there 
exists 01,02 G E* rac/i that x G (0102)*, z G (0201)* ant/ 
xj/z G ai(a 2 ai)*- 

Proof: By Lemma[JJ/i(x) ~ and /i(z) ~ there- 
fore /i(x) ~ m(^), i.e. there exists 01,02 with x G (0102)* 
and z G (0:20:1)*. Moreover as x* is a prefix of t w for all 
i > 0, clearly jti(t) = = 0102- 

Now let xyz — (0102)^0 a prefix of (0102)" and let us 
show that a = ct\. So suppose a/3 = ai (the other case when 
Oi/3 = a is proved similarly). Therefore z — (0201)" = 
(a 2 af3) a but also xyz = (oi02) fc o implies that z = (/3o2o) a . 
So f3a 2 a = 020/3 which means 0102 is not primitive if (3 ^ e. 



B. Proof of Proposition [4] 

Proof: • If |out2[l,n — 1]| < K, then clearly, it suffices 
to take £ = n, t\ = out 2 [n — 1, n], t% = e, <3 = out2[0, n — 1], 
w = outi[l, n] and w' — out 3 [l, n + 1]. 
• Otherwise, |out2[l,n — 1] > K. Therefore u is of length 
2.771 3 . 1 S I at least and there exists necessarily a (non-empty) 
loop in p. We can always choose this loop such that 

|out 2 [l,i]| < K and 1 < |out 2 [i,j]| < K (see Lemma [Toll. 

The loop partitions the input and output words into factors 
that are depicted in Fig. [5j (only the two first passes are de- 
picted). Formally, let u — U1U2U3 such that 1*2 = u[i..(j— 1)]. 
Let xq = outi[l,i], v% = outi[i, j], Xi = outi[j, ri]out 2 [j, n], 
v 2 = out 2 [i,j], x 2 = outi[l,i], x 3 = out 3 [l,i], v 3 = 
outa[i,j] and X4 = out3 [7,71+1]. In particular, we have \x2\ < 
K, 1 < |«2| < K and X{)V\X\V2X2Xj,v^x^ G T(u). Since 
is a loop we also get x$v\ X1V2X2X3V3XA G T^xu^u^) for 
all k > 0. 

We then distinguish two cases: 

1) If v\ ^ e. We can apply Property V by taking the second 
loop empty. We get that for all k > 

f{k)x V k 1 C+C ' ' Xl V k 2 C+C ' 'x 2 X 3 V* C+C ' 'x 49 (k) = fcfifo 

where /,g:N^S*,ce N> , c' G N, and /3i, /3 2 , £3 G 
E*. Since the above equality holds for all k > 0, we 
can apply Lemma [TJ and we get p,(v\) ~ //(/?2) and 
^(1^2) ~ m( u 2), and therefore ~ p-fa). So there 

exist x, y G E* such that i>i 6 (xy)* and u 2 G (yx)*. 
By Lemma [17] we obtain that vixii^ G x(yx)*. Then 
it suffices to take £ — i, w = xq, t\ — x, £ 2 = 2/^ and 
^3 = X2 to conclude the proof. 

2) Otherwise, we have v\ = e. We decompose x\ as xi = 
?/i2/2 where yi = outi[j,n] and 2/2 = out 2 [j, n}. 

We again distinguish two cases: 
a) We first consider the case when |yi| = 
|outi[j, n]| > K. In this case, we can as before 
decompose the input word u[j..n] to identify a 
loop. More precisely, there exists a loop (r, s) in 
p such that r > j, |outx[7, r]| < AT and 1 < 
|outi[r, s] < A. This loop gives a decomposition 
of U3 as U4M5M6 . We will then apply Property V to 
the two loops (i, j) and (r, s). The loop (r, s) gives 
a decomposition of y\ as zqW\Z\, y 2 as Z2W2Z3 and 
X4 as Z4W3Z5. By Property T 5 , there exist words 
i G {1, . . . , 5}, and ci, c' x , c 2 , c' 2 , /, 5 such that, for 
all ki,k 2 > 0, 

X3^ 1 z4^ 2 ^5(fci,fc 2 ) = /3i/3 2 fel /3 3 /3 4 fe2 /?5 

where r/^ = fcjCi+c^, i G {1, 2}. Recall that wi ^ e 
and W2 / £■ As a consequence, we can, using 
sufficiently large values of k\ and k 2 and applying 
LemmafTJ prove that p(w\) ~ /i(/?4), that ^(^2) ~ 
^{Pij, and thus deduce that p{w\) ~ p,(v 2 )- 
Therefore there exist x,?/ such that «2 G (2/^)* 
and u>i G (xj/)* from which we deduce that 



W1Z1Z2W2Z3V2 G x(yx)*. Recall that by the choice 
of the loop (r, s) we have \zq\ < K. We can thus 
define £ = i, w — xq, t\ — zqx, t 2 — yx and 
£3 = %2 to obtain the result, 
b) The last case is when \yi\ = outi[j, n}\ < K. 
We consider the length of y^ — out2[j, n]. First 
observe that if we have \y2 < K then we are done. 
Indeed, we can define £ = j, t\ = y\, t% = yi and 
£3 = V2%i- It is routine to verify that the conditions 
of Property V\ are fulfilled. 

We thus suppose that \y 2 \ > K. In this case, we can 
as before identify a loop (r, s) in the run p such that 
r > j, out 2 [s, n] < K and 1 < out 2 [r, s] < K. We 
do not give the details, but one can apply Property 
V to the two loops (i, j) and (r, s) and use the fact 
that out2[i, j] ^ e and out2[r, s] ^ e to prove that 
fj, (out2[i, j]) ~ p(out 2 [r, s]). Then, there exist x, y 
such that out2[r, s] G {xy)* and out2[i,j] G {yx)* 
from which we deduce that out 2 [i, s] G (xy)*x. 
Finally, we let £ = i, w — xq, t\ — 
outi [i, n]out2[s, n], t 2 — xy and i 3 = XX2 to 
obtain the result. 

■ 

C. From eZNFT to NFT 

We state the following Lemma whose proof is similar to 
that of Lemma [T6l 

Lemma 18. Let T e ZNFT with m states. Let o the 
maximal length of an output word in a transition of T and 
K = 2.o.(m 3 .|S|). Let p be a run on a word u of length 
n. We write p as the sequence (pi, 1) . . . (p„, n)(q n -i, n — 
1) . . . (qi, l)(r2,2) . . . (r n +i,n + 1) and let q n = p n and 
r\ = q\. Let two indices 1 < i < j < n. Then, we have: 

1) if out3 [z, j] I > K, there exists a loop (k\, ^2) in p with 
i < ki < k2 < j such that 

a) |out3[i, ki] I < K 

b) 1 < |out 3 [fci,fc 2 ]| < K 

2) if out3 [z, j] I > K, there exists a loop {k\, fe) in p with 
i <k\ <k 2 < j such that 

a) |out 3 [fc 2 ,j]| <K 

b) 1 < |out 3 [fci,fc 2 ]| < K 

3) ;/ |outi[i, j]\ > K, there exists a loop (k±, ^2) in p with 
i < ki < k,2 < j such that 

a) |outi[i,fci]| < K 

b) 1 < |outi[Ai,A 2 ]| < K 

4) if |outi[i, j]\ > K, there exists a loop (ki, fe) in p with 
i < k\ < k,2 < j such that 

a) |outi[fc 2 ,j]| < K 

b) 1 < |outi[Ai,*!2]| < K 

Proof of Proposition |6] 

Proof: We let T = (Q,q ,F,A) and K = 2.o.m 3 .|E|. 
Recall that as T' G eZNFT, we have out2[l,n] = e. 

Let us define the position £ as the largest positive integer 
less than or equal to n such that outi[£, n] = e. 



< k < K 
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1 k-L = ii k 2 l = l 2 n 



Fig. 7. Decomposition of the output for case 1. 1) 

We first observe that if |out 3 [l,£]| < K, then we are done, 
by considering £\ = £2 = I. Indeed, we then consider w = 
outi[l, £], w' — out 3 [£, n + 1], t\ — out 3 [l, I], and t 2 = t 3 = 
e. 

Thus, we now suppose that we have |out 3 [l,^]| > K. In 
this case, we can apply Lemma [T8l case 1): there exists a loop 
(^1,^2) such that |out 3 [l, k\] < K and 1 < |out 3 [fci, ^2] < 
K. 

We again distinguish two cases: 
Case I: out 3 [A:2, £) | < K. For this case, we again distinguish 
three cases, depending on the value of outi[fci, £2] and on the 
length of |outi[fc 2 ,^| < K: 

1) if we have outi[fci,fc2] 7^ e. We will prove that the 
output word outi [fci, n]out 3 [l, £} has the expected form 
(tit^ts). Therefore we use the P-property on the loop 



(k\,k2) with an 


additional empty loop. We define 


n> 


= outi[l, ki] 


Xi 


= outi [A;i, k 2 ] 


y 


= outi [k 2 , fi]out 3 [l, k{\ 




= out 3 [fci, k 2 ] 


Z 


= out 3 [fc 2 ,^] 


w' 


= out 3 [^,n+ 1] 



Property V entails that there exist /3i, /3 2) ^3, /, 9, c, c' 
such that, for all k > 0, 

f(k)wx k ; c+c 'yx k 2 c+c 'zw'g(k) = fotffo 

As we have x\ ^ e, and X2 ^ e, this entails, thanks 
to the fundamental lemma (Lemma Q]), that p(x\) ~ 
jti(a; 2 ). Let t 2 be p(x\). We can write t 2 = z\z 2 and 
n{x2) = Z2Z\. As a consequence, we obtain that x\yx2 
is of the form t\.Z\ by Lemma [TT] We can thus set 
t\ = e, £3 = Z\.z, £\ = ki and £2 = £■ It is routine to 
verify that words w, w', t±, t%, t% verify the conditions of 
V 2 -property. 

This case is depicted on Figure [7] 

2) if we have outi[fci,fc2] = e and outi[fc2,^]| < K. 
We will show that the result is easy. Indeed, consider 
£\ = ki, £2 — £, ti = outi[fci,n]out 3 [l, k\], t 2 = 
out 3 [/ci, fc 2 ], and t 3 = out 3 [/c 2 ,^]- It is routine to verify 
that all the requirements of P2 -property are met. 

This case is depicted on Figure [8] 

3) last, if we have outi[fci,fc2] = e and |outi[fc2,^]| > K. 
In this case, we will have to identify a loop in this part 
([k 2 , £]) of the input word, to prove the expected form of 
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Fig. 8. Decomposition of the output for case 1.2) 
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Fig. 9. Decomposition of the output for case 1.3) 



Fig. 10. Decomposition of the output, case II. 1). a) 
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1 ki = £1 k 2 = £2 £ 

Fig. 11. Decomposition of the output for case II. l).b) 



the output words. Formally, we apply Lemma [T8l as we 
did before, except that we are interested in the output 
produced in the first pass of the ZNFT, and not in that 
produced in the third pass. We thus apply case 3) of 
Lemma [T8l We can thus exhibit a loop (jx,j 2 ) with 
k 2 < jx < 32 < — 1 suc h that |outi[fc2, ji]\ < K and 
1 < |outi[ji,ia]| < K. 

We are now ready to prove that the output word 
outi[/ci,ri]out 3 [l,^] has the expected form (tit^ts). To 
this aim, we define: 
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= u[\, ki - 
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= OUtl[fcl,Jl] 


U3 


= u[k 2 ,jx 


-1] 




= outi[ji,j 2 ] 
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= outi[?2,n]out 3 [l,fci 


«5 


= u[j 2) £- 
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= out 3 [fci, fc 2 ] 
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= u\i, n] 




Zl 


= out 3 [fc 2 , jx] 
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= out 3 [£,«+!] 



As (ki, ki) and (ji,j 2 ) are loops, we can apply Property 
V. Using the fundamental lemma, we can deduce that 
/i(xi) ~ li(xi), using a reasoning similar to that of the 
proof of Proposition [4] Thus, we can set t 2 = n(xx), and 
write ti = cticii such that ^(x 2 ) — a 2 ai, from which 
we deduce xiyx 2 G t\oti (Lemma [TTTi. Finally, we let 
z = Z1Z2Z3, t 3 = ctiz, ti — ki and £2 — £■ The reader 
can verify that all the requirements of ^-property are 
met. 

This case is depicted on Figure [9] 
Case II: we have out 3 [fc2,^]| > K. We distinguish three 
cases, according to the length of the word outiffe,^, and 
to the value of outi[fei, k 2 ]: 

1) if we have |outi[fc2,^]| < K, we distinguish two cases: 
a) We first consider the case when outi[fci,fc2] = e. 



In this case, we can simply define £1 = £2 — fci, 
and verify that the conditions of the 7^ -property 
are met. This case is depicted on Figure [10] 
b) The second case is when outi[fci, k 2 ] 7^ £• This 
case is easy as we can show that /i(outi[fci, k2]) ~ 
/i(out 3 [fci, k 2 ]), and deduce the expected form for 
the output words, by setting £2 — k 2 . This case is 
depicted on Figure [TT] 

2) if we have |outi[fc2,^]| > K and outi[ki, k 2 ] 7^ e. As 
outi[fc2,^]| > K, we can apply Lemma [T8~l case 4), to 
identify a loop (ji,j 2 ) such that |outi[j2,^]| < K and 
1 < outi[ji,j2] < K. In this case, we set £1 = ki and 

There are three cases, according to out 3 [71,72] and 
out 3 [fc 2 , jx]: 

a) We first consider the case when out 3 [71,^/2] 7^ 
e. In this case, using Property V, we can 
show that //(outi[A;i, k 2 ]) ~ n(outx\jx,j 2 ]) ~ 
/i(out 3 [fci, fa]) - ^(out 3 [ji,j 2 ])- This allows to 
prove the expected form of the output words. 

b) Second, we suppose that out 3 [?i,j2] = e and that 
out 3 [fc2,ji] < K. In this case, we can use the 
word i 3 to cover the output word out 3 [k 2 ,ji]. Last, 
using a reasoning on word combinatorics, we can 
prove that fx(outi[ki,k 2 ]) - jti(outi[j'i, j 2 ]) ~ 
fi(outs[ki , k 2 ]) and conclude. 

Cases a) and b) are depicted on Figure [T2] 

c) Last, we consider the case out 3 [71,72] = e 
and out 3 [fc2,ji] > iC. By Lemma [18] case 
2), there exists a loop (^1,^2) included in 
the interval [fojji] sucn that out 3 [p2,ji] < 

and 1 < |out 3 [pi,p2]| < We claim 
that the result holds. The only difficult prop- 
erty is the fact the output word has the ex- 
pected form (tit^ts). This can be proven using 
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Fig. 12. Decomposition of the output for case II. 2). a) and b) 
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Fig. 13. Decomposition of the output for case II. 3). a), out3[j'i, j'2] 7^ e 

word combinatorics, by showing, using the Prop- 
erty V, that /i(outi[fci,fc 2 ]) ~ ^(out^j!,^]) ~ 
/u(out 3 [fei, fc 2 ]) - /i(out 3 [pi,p 2 ])- 
3) last, if we have |outi[fc 2 ,£]| > K and outi[fci, k 2 ] = e. 
We first let i\ — k\. As we have |outi[fc 2 ,£]| > K, we 
can apply Lemma [T8l case 3), to identify a loop (ji,j 2 ) 
included in the interval (k 2 ,£) such that |outi[fe 2 , < 
K and 1 < |outi[ji, j 2 ]\ < K. We distinguish two 
cases: 

a) if |outi[j2,<]| < K. We define l 2 = j 2 . We 
consider the value of out3 \j\ , j 2 ] . 

If we have outs\ji,j 2 ] ^ e, then we can 
conclude. Indeed, using word combinatorics, we 
can prove /Lt(outi [71 , j 2 ]) ~ /Lt(out 3 [fei, k 2 ]) ~ 
/i(out3[ji, j 2 ]) and prove that the output word 
outi[fci, n]out3[l, j 2 ] has the expected form. This 
case is depicted on Figure Qj] 
Otherwise, we have out3 [ji , j 2 ] = e. For this case 
we distinguish two cases: 

i) if out3[fc 2 , ji] < K: we can conclude directly. 
Indeed, it is easy to show that /i(outi [ji , j 2 ]) ~ 
/i(out 3 [fci, /c 2 ]). The word outs[k 2 , j 2 ] is not 
necessarily conjugated with the previous words, 
but its length is less than K by hypothesis, thus 
we can use the word i 3 to handle this part of 
the output. This case is depicted on Figure [14] 

ii) if out3[fc 2 , ji] > K: we will apply Lemma [T8l 
case 2), to identify a loop (pi,p 2 ) included in 
the interval (k 2 ,ji) such that |out 3 [p 2 , < 
K and 1 < |out 3 [pi,p 2 ]| < K. Then we can 
prove that /i(outi[j'i, j 2 \) ~ /u(out 3 [fci, k 2 ]) ~ 
/i(out3[pi,p 2 ]) and conclude. 

b) if |outi[j 2 ,^]| > K. We can apply Lemma [T8l 
case 4), to identify a loop (pi,p 2 ) included in the 
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Fig. 14. Decomposition of the output for case II.3).a).i) 

interval {j 2 ,£) such that |outi[p 2) £]| < K and 1 < 
|outi[pi,p 2 ]| < K. In the sequel, we let i 2 be p 2 
and i\ be ki. We let a = out3[pi,p 2 ] and a' = 
out3[ji, j 2 ]. The situation is depicted on Figure [131 
We distinguish five cases: 

i) if a ^ e, we conclude easily by show- 
ing that /i(outi[ji, j 2 ]) ~ /x(outi[pi,p 2 ]) ~ 
fi(ont 3 [k 1 ,k 2 \) ~ /i(out 3 [pi,p 2 ]). 

ii) if a = e and |out3[j 2 ,px]| > we can 
identify a loop in p, included in the interval 
[j2,Pi], such that out,3 is non-empty on this 
loop. We can then derive the result. 

iii) if a = e, |out3[j 2 ,pi]| < K and 
a' ^ e, then we can show that 
fi(outi\ji,j 2 ]) ~ |u(outi[pi,p 2 ]) ~ 
^(out 3 [fci,fc 2 ]) - fj,(out s \jx,j 2 ]), and 
conclude as the output out3[j 2 ,p 2 ] has length 
less than K (t^ can be defined so as to cover 
these words). 

iv) if a = e, |out3[j 2 ,pi]| < K, a' = e and 
out3[fc 2 ,ji]| > K, we can identify a loop 
inside the interval [fc 2 ,ji]. This loop can be 
used to prove the result, as we know that the 
length of the word out3 [ji , p 2 ] is less than K. 

v) else, i.e. if a = e, |out3[? 2 ,pi]| < K, a 1 — e 
and out3[/c 2 , ji} \ < K, then we are done as £3 
can be defined as out 3 [/c 2 ,p 2 ]. 

■ 

Construction of T" from T 

We provide here some additional details for the definition 
of the NFT T" from the eZNFT T. 

First, the transducer T" should, in a single forward pass, 
simulate the three passes (forward, backward, and forward) 
of T 1 . Therefore it maintains a triple of states of T 1 and the 
current symbol. 

Second, it uses three modes: before the guess of position 
li, between positions l\ and £ 2 , and after position l 2 . 

Third, it should guess the words of bounded length t\, t 2 
and £3, and two additional words x and y of bounded length 
(< 3. if ) which intuitively correspond to words out3[l, l{\ and 
outi[^ 2 ,n] (see property P 2 ). 

Last, it verifies in the different modes that the output 
has the expected form, and produces in a forward manner 
the overall output word. Therefore it distinguishes between 
different cases, whether t\ is a prefix of outi [£1, £2} or whether 



< K 



out 3 



outi 















w 


e 












e 


















v- 

< K 





1 ki = d k 2 ' jl h Pi P2 = f-2 ' 

Fig. 15. Decomposition of the output for case II.3).b) 



t\ also covers outlaw] or out3[l, or even out 3 [^i,f 2 ]- It 
manipulates pointers in the different words of bounded length 
it has guessed to verify the form of the output, and to produce 
the correct output, as we did in the construction of T' . 

Appendix C 
Lower Bound 

Lemma 19. (2DFT, NFT)-definability is PSpace-Hard. 

Proof: Consider n DFAs A\, . . . ,A n . Let us define the 
following transduction (where # ^ £): 

ul if u = #ui#u 2 # and u 2 G f|i L(A) 
undefined otherwise. 



Clearly, T is definable by a 2DFT. It suffices to first perform 
n back and forth non-producing passes on u to determine 
whether m 2 G f] { L(Ai), and then a last backward pass to 
reverse U\. 

Then, T is jVfT-definable iff dom(T) = iff f| 4 L(A) = 
0. Indeed, if dom(T) = then T is obviously NFT- 
definable. Otherwise, there exists w 2 G Hi L(Ai), and there- 
fore #E*#u 2 # C dom(T). If T is WT-definable, then so 
would be the reverse operation. Contradiction. ■ 



